USER REQUEST PROCESSING METHOD AND APPARATUS USING UPSTREAM 
CHANNEL IN INTERACTIVE MULTIMEDIA CONTENTS SERVICE 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention relates to a system providing bidirectional 
communication services, and more particularly, to a user request processing method 
and apparatus using an upstream channel in interactive multimedia contents 
services. 

2. Description of the Related Art 

In order to provide various functions for interactive multimedia contents 
services, in the field for providing bidirectional communication services, for example, 
in the field of moving picture expert group (MPEG}-4 system (ISO/IEC JTG 1/SG 
29AA/G1 1 14496-1 ), various technologies for enabling those services are being 
standardized. To satisfy various requirements from users who receive multimedia 
contents services, those systems are providing functions for each multimedia 
contents provider processing various events occurring at users sides, through an 
upstream channel of each multimedia contents provider. 

Between a server (including an encoder) for encoding and managing data on 
arbitrary object information, and a user terminal (including a decoder) for receiving 
and restoring the data, two types of channels for transmitting data exist including a 
downstream channel for transmitting data from the server to the user terminal and 
an upstream channel for transmitting data from the user terminal to the server. 
When user's various requests should be processed frequently as in video-on- 
demand (VOD) or World Wide Web (WWW) environment, an upstream channel are 
used. That is, the upstream channel is used in a system for providing bidirectional 
communication services, in which necessary information is immediately provided 
from a server for services that cannot be processed in a user terminal. 

For data, in which errors occurred, on an application which provides only 
one-way telecommunications services, the processing method from the system side 
is different from that from the user side. On the system side, the size of a 
transmission data packet on a channel is checked, or whether or not data error 
occurred is checked according to the data format types such as a parity bit, in a 



lower layer of a network, such as a TCP/IP layer. If an error occurred, the server 
receives the data again for processing the error. However, in this method, it is a 
problem that logical errors occurred in transmission data or retransmission service 
in order of importance of data to be restored cannot be provided from the system 
side. Therefore, to consider this problem, a system for providing bidirectional 
communication service is needed, in which the user terminal checks errors in the 
restoration process, and if an error exists, transmits related information for restoring 
original data to the server through an upstream channel, and then the server 
restores error-containing data, retransmits the data to the user terminal so that the 
original data can be restored and errors can be processed. 

Meanwhile, an MPEG-4 system providing bidirectional communication 
services enables to use an upstream channel provided by each multimedia content 
provider in processing various events occurring in a user terminal. However, more 
specifically, for example, in various interactive multimedia contents services such as 
providing three-dimensional scenes for users, and receiving a request of processing 
some nodes in a scene from a user, the MPEG-4 system does not propose how to 
process the user request using the upstream channel. 

SUMMARY OF THE INVENTION 
To solve the above problems, it is an object of the present invention to 
provide a user request processing method and apparatus using an upstream 
channel, in which events of user requests on partial elements in arbitrary multimedia 
contents provided for the user as interactive multimedia contents services can be 
properly handled. 

It is another object to provide a user request processing method and 
apparatus using an upstream channel, in which events of user requests on some 
nodes in a scene provided for the user in MPEG-4 binary format for scene (BIFS) 
format can be properly handled. 

To accomplish the above object of the present invention, there is provided a 
user request processing method, using an upstream channel, after a three- 
dimensional scene generated based on a binary format, is transmitted from a server 
to a terminal, the user request processing method having the steps of (a) setting 



downstream/upstream channels between the server and the terminal as 
initialization; (b) the terminal forming an upstream channel message if a user 
request of predetermined processing of a predetermined object is occurred in a 
scene transmitted from the server to the terminal through the downstream channel, 
and transmitting the message to the server through the upstream channel; (c) the 
server receiving the upstream channel message, interpreting the message, 
processing the message as the user request of predetermined processing, and 
transmitting the result to the terminal; and (d) the terminal substituting the 
processing result of step (c) for the predetermined object in the scene transmitted in 
step (b), and providing it to the user. 

To accomplish another object of the present invention, there is also provided 
a user request processing apparatus using an upstream channel in a system 
providing bidirectional communication services, the user request processing 
apparatus having a server for transmitting through a downstream channel a three- 
dimensional scene generated based on a binary format, receiving and interpreting 
an upstream channel message, and processing the message as user's request of 
predetermined processing; and a terminal for forming an upstream channel 
message if a user request of predetermined processing for a predetermined object 
in the scene transmitted from the server occurs, and transmitting the message to the 
server through an upstream channel. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The above objects and advantages of the present invention will become more 
apparent by describing in detail preferred embodiments thereof with reference to the 
attached drawings in which: 

FIG. 1 is a flowchart for explaining a user request processing method 
according to the present invention; 

FIG. 2 illustrates the structure provided on an MPEG-4 system for setting an 
upstream channel function; 

FIG. 3 is a block diagram for showing a user request processing apparatus 
according to the present invention; 



FIG. 4 is a block diagram for a preferable embodiment of a terminal forming 
upstream channel information according to the present invention; 

FIG. 5 is a block diagram for a preferable embodiment of a server processing 
upstream channel information according to the present invention; 

FIGS. 6A through 6D illustrate an example of a process for processing a user 
request of a scene description stream, using an upstream channel; and 

FIGS. 7A and 7B illustrate the syntax of upstream channel information and 
the syntax of related information. 

DETAILED DESCRIPTION OF THE INVENTION 
Hereinafter, embodiments of the present invention will be described in detail 
with reference to the attached drawings. The present invention is not restricted to 
the following embodiments, and many variations are possible within the spirit and 
scope of the present invention. The embodiments of the present invention are 
provided in order to more completely explain the present invention to anyone skilled 
in the art. 

The same reference numbers refer to the same respective elements 
throughout the drawings. 

FIG. 1 is a flowchart for explaining a user request processing method 
according to the present invention. 

FiG. 1 explains a user request processing method using an upstream 
channel, as a preferable embodiment of the present invention, when a scene, which 
is generated based on MPEG-4 BIFS describing scene nodes for expressing three- 
dimensional scene on an MPEG-4 system, is transmitted from a server to a user 
terminal. In the present invention, a scene can be a still picture of one frame 
containing objects, or moving pictures or a series of pictures formed of a plurality of 
frames. Here, the MPEG-4 BIFS in the MPEG-4 system is just one example. The 
present invention is used in all bidirectional communication service systems for 
providing interactive multimedia contents services, and more characteristically, is 
used in various systems providing contents of scenes generated based on various 
binary formats for describing scene nodes. 



FIG. 3 is a block diagram for showing an embodiment of a user request 
processing apparatus according to the present invention, and the apparatus has a 
server 300 and a terminal 310, both of which are connected to each other through a 
network, and the terminal 310 contains a decoder 320 and a user interface 330. 
Here, when the user request processing apparatus according to the present 
invention is implemented so as to provide an upstream service function for properly 
handling user requests on an MPEG-4 BIFS scene in an MPEG-4 system, the server 
300 has an MPEG-4 scene encoder, and the decoder 320 has an MPEG-4 scene 
decoder. The user request processing apparatus according to the present invention 
forms upstream channel information (Upstream_Channel_Message()) in the terminal 
310 so as to guarantee the integrity of a BIFS scene in the MPEG-4 system, and 
transmits the information to the server 300 through an upstream channel. The 
server 300 processes scene nodes selected in the subject BIFS scene according to 
the processing type requested by the terminal 310. 

The operation of the present invention will now be explained in detail 
referring to FIGS. 1 and 3. 

First, if the server 300 and the terminal 31 0 are connected to each other 
through a network, as an initial step, downstream/upstream channels are set so that 
information can be transmitted between the server 300 and the terminal 310, in step 
100. Particularly, the present invention uses a structure provided on an MPEG-4 
system, as shown in FIG. 2, so that an upstream channel function for transmitting 
information from a terminal to a server is set to satisfy a user request. 

Referring to FIG. 2, an initial object descriptor 200 Is formed of four types of 
elementary stream descriptors (ES_Descriptor). The four types of ES_Descriptors 
describes a scene description stream 220, which describes initial information to be 
transmitted from a server to a terminal, that is, descriptive information of objects 
forming a scene, the location of object information, relations between objects, and 
its upstream 222, and an object descriptor stream 224, which shows descriptors for 
data streams, which will actually receive information on each object, and the types 
: of respective objects in a scene, and its upstream 226. 

As shown in FIG. 2, based on the initial object descriptor 200, four stream 
channels 220 through 226 are set in the terminal. Each ES_Descriptor 212 and 214 



is connected to appropriate streams, using an elementary stream identifier (ES_ID) 
which can distinguish each stream. Therefore, using ES_IDs contained in the initial 
object descriptor transmitted from the server, the terminal identifies a scene 
description stream and an object descriptor stream, which are transmitted through 
the network channel. 

More specifically, a scene description stream contains node identifiers for 
each object forming a scene, temporal/spatial information on each object, and 
information on correlation of respective objects. Also, the scene description stream 
contains object descriptor ID (OD_ID), which is a pointer to the actual location of 
descriptive information on each object. Using this ODJD, the terminal identifies 
information on each object in an object descriptor stream, and connects the object 
structure in an MPEG-4 BIFS scene. The upstream for the scene description stream 
means one upstream channel for all objects forming an MPEG-4 BIFS scene. In the 
structure of the ES-descriptor 214, describing the upstream in the initial object 
descriptor, upStream flag is initialized at "1" and streamDependenceflag is initialized 
at "1" so as to indicate that ES_descriptor 214 is dependent on ES_descriptor 212. 
The terminal checks that upStream flag is "1" and then specifies that one channel is 
an upstream channel for transmitting information to the server. Then, using 
StreamDependenceflag, it is specified that the upstream 222 is about a scene 
description stream providing information which can be currently used. The 
upstream 222 can appropriately form information to be transmitted to the server, 
using all information related to the entire scene described by the referred scene 
description stream. Each object descriptor in an object descriptor stream is formed 
of a plurality of ES_Descriptors, including ESJDs which can distinguish streams 
actually having descriptive information on each unit object, upstream information on 
whether or not each object has an upstream channel from the terminal to the server, 
and encoding information on objects being transmitted from the terminal to the 
server. The terminal identifies elementary streams containing actual object 
information, using ESJD information in ES_Descriptor, and determines the 
presence of an upstream channel for each object, using upstream information. 

Therefore, the MPEG-4 system supports the user side so that the user can 
transmit various information to the server using the upstream channel 222 shown in 
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FIG. 2. Using this structure, the present invention, which will be explained, 
discloses methods for generating and forming upstream channel information needed 
in processing various user requests transmitted from the terminal to the server, a 
method for server interpreting information received from the terminal, responding to 
the upstream channel information, and a method for appropriately generating 
response information to be transmitted to the terminal using the interpreting method. 

Referring to FIG. 1 again, after step 100, information on a BIFS scene is 
transmitted from the server 300 to the terminal 310 in step 110. This step is 
performed by information transmitting and processing method commonly known 
between the server and the terminal. Next, the scene is provided to the user by 
displaying the scene at the terminal 310 in step 120. 

Next, whether or not a user request of processing a predetermined node of a 
predetermined object in the scene occurred is determined in step 130. If so, 
upstream channel information for predetermined processing is formed and 
transmitted to the server 300 through the upstream channel in step 140. 

In FIG. 3, the decoder 320 interprets the scene description stream 
transmitted from the server 300 so that the result can be displayed through the user 
interface 330 including an indicator. The user interface 330 passes the user request 
on displayed information to the decoder 320. Since the decoder 320 has 
information on the currently transmitted scene, the decoder 320 can figure out 
which node in the scene corresponds to the user request. Together with a 
command corresponding to the user request, the corresponding node identifier 
(Node ID) of the node interpreted in the decoder 320 forms upstream channel 
information (Upstream_Channel_Message ()). 

Next, the server 300 receives and interprets upstream channel information, 
processes it as user's request of predetermined processing, and transmits the result 
to the terminal 310 in step 150. Finally, the terminal 310 provides newly sybstituted 
node(s) for the user, by displaying node(s), which was processed in step 150 
according to the user's request, in step 160. 

FIG. 4 is a block diagram for a preferable embodiment of the terminal 310, 
which forms upstream channel information in step 140, according to the present 
invention. 



Referring to FIG. 4, step 140 described above will now be explained in detail. 

If a user watches the display screen and selects a predetermined object, a 
user request (user_request) for a predetermined object in the entire BIFS scene is 
generated and transmitted through a user interface 400. A note interpreter 41 0 
defines a corresponding node in the current scene in which the user request is 
generated, using a node identifier (NodelD) according to the location of a node used 
in the scene or sequence information, etc. 

By the presence of a NodelD, a NodelD presence determiner 412 determines 
whether or not the defined node is a reusable node in the scene. The NodelD 
presence determiner 412 are informed in advance of a NodelD for each object in the 
scene, as information on the entire BIFS scene containing the predetermined node. 
Basically, In the scene transmitted from the server 300 to the terminal 310, a proper 
identifier, such as a NodelD, is assigned to and used in a node for each object, so 
as to provide reusability, that is, a function capable of responding to a user request. 
At this time, whether or not a NodelD is assigned Is determined by the use of a 
DEFinition (DEF) command. The DEF command is defined in a scene description. 
For example, if a node is not defined by a DEF command, that is, it is not allowed to 
reuse the node in the scene, a NodelD for the node is not assigned in the scene. 

If a NodelD assigned for reuse is not in a node defined by the node 
interpreter 410, a NodelD generator 414 finds out a node, for which a NodelD is 
defined, among the nodes immediately above the node defined in the BIFS scene, 
and process the NodelD of the above node so as to be used as the NodelD of the 
node selected by the user. Also, if reuse is not allowed for all nodes used in the 
scene, for example, a NodelD is defined as a value less than "0" so that all nodes, 
including the highest node, contained in the scene can be processed in the server. 

Meanwhile, a command generator 420 defines a command to be processed in 
the server, for a node defined in the node interpreter 410, appropriately to the user 
request. With commands such as retransmission, deletion, or insertion of the 
requested node, predetermined commands for processing the node are defined, 
considering the characteristic of the node. An upstream channel mtessage 
transmitter 430 forms an upstream channel message by combining the NodelD 
defined as described above in the node interpreter 300 and the commands defined 
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in the command generator 315, and transmits the upstream channel message to the 
server through the upstream channel. 

FIGS. 7A and 7B illustrate the syntax of upstream channel information and 
the syntax of related information. 

FIG. 5 is a block diagram for a preferable embodiment of the server 300, 
which processes the upstream channel message in step 150, according to the 
present invention. To process the upstream channel message, the server 300 has 
an upstream channel message receiver 500, a node interpreter 505, a command 
processor 510, and a node encoder 515. 

Referring to FIG. 5, step 150 described above will now be explained in detail. 

First, the upstream channel message receiver 500 receives an upstream 
channel message (Upsream_Channel_Message()) through the upstream channel. 
The node interpreter 505 confirms a NodelD in the upstream channel message. 
Then, the node interpreter 505 confirms whether or not the confirmed NodelD is for 
a node contained in the BIFS scene, the structure of the subject node indicated by 
the NodelD, and information on nodes directly dependent on this node. 

Subject nodes to be processed are defined by a NodelD. When necessary, 
the node indicated by the NodelD and nodes dependent thereon are defined as 
subject nodes. At this time, if the confirmed NodelD does not exist in the BIFS 
scene to be processed, the node interpreter 505 stops execution. If the confirmed 
NodelD is for all nodes in the BIFS scene, the all nodes in the scene are defined as 
subject nodes. 

If subject nodes are defined as described above, the command processor 510 
confirms node commands to be executed in the server, in the upstream channel 
message receiver 500, and executes node operations for processing subject nodes, 
according to the node commands. The subject nodes processed in the server 
according to node commands are formed into a BIFS message in the node encoder 
515, and transmitted to the terminal through the downstream channel. 

So far, the structure and operation of the present invention have been 
explained. The present invention, for example, if a user request for retransmission 
of each predetermined object is transmitted to the server through the upstream 
channel, processes in the server only predetermined subject nodes to be 
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transmitted so that the nodes can be transmitted to the terminal. By doing so, 
restoration in the terminal of predetermined objects, in which errors occurred, can be 
quickly performed and information volume transmitted for the restoration can be 
minimized. 

FIGS. 6A through 6D illustrates an example of a process for processing a 
user request of a scene description stream, using an upstream channel. 

FIG. 6A illustrates an example of transmitting, to the terminal, the NodelD of 
each node forming a scene, temporal/spatial information on each object, and 
information on correlation among each object, using a scene description stream in a 
server having an MPEG-4 BIFS scene encoder. The server can transmit the 
information described above, all or partially. The decoder of the terminal interprets 
the information so that the user can watch the information on a display screen. 

FIG. 6B illustrates an example of displaying nodes in an MPEG-4 BIFS scene 
containing an error, in a terminal having an MPEG-4 BIFS scene decoder, when 
transmission errors occur in transmitting nodes forming a scene through the 
downstream channel. The error occurred at the node whose NodelD is "5". The 
user watches the display screen and selects the object, in which the error occurred, 
using an input device such as a mouse. To retransmit to the user only the node 
selected by the user, the terminal forms an upstream channel message, for 
example, "retransmit NodelD 5", and transmits the message to the server. Even 
when an error occurred at an arbitrary node in a layer lower than the layer 
containing the node whose NodelD is "5", if a NodelD is not assigned to the 
arbitrary node of the lower layer, the terminal also transmits "retransmit NodelD 5". 

FIG. 6C illustrates an example of the server processing a scene when the 
terminal requests the server to retransmit a predetermined node. Using the NodelD 
of the corresponding object, the server finds out the corresponding node in the 
hierarchical structure of the MPEG-4 BIFS scene, and transmits the corresponding 
node, and if necessary, information on the corresponding node and all nodes of 
lower levels. For example, if an error occurred at a node whose NodelD is "5", the 
terminal needs only the corresponding node information, but if an error occurred at 
an arbitrary node of a level, lower than the node whose NodelD is "5", the terminal 
needs information on the corresponding node and all nodes of lower levels. 
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FIG. 6D shows the result of displaying a scene when the terminal received 
information on a node, in which an error occurred, from the server. The terminal 
displays the scene as shown in FIG. 6D, by substituting information on retransmitted 
node in the display screen shown in FIG. 6B. 

According to the present invention, by properly handing events of user 
requests on partial elements in an arbitrary multimedia content provided for the user 
as interactive multimedia contents services, predetermined elements can be 
selectively processed. Therefore, information volume to be transmitted for 
restoration from an error can be minimized, and the terminal can quickly receive 
restored information. Particularly when an error occurred in an MPEG-4 BIFS 
scene, only predetermined nodes can be selectively processed in the MPEG-4 
system, and therefore, information volume to be transmitted for restoration from the 
error can be reduced and the terminal can quickly receive restored information. 
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